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Abstract. The H-iree process starts with the empty graph on n vertices and adds 
edges chosen uniformly at random, one at a time, subject to the condition that no 
copy of H is created, where H is some fixed graph. When H is strictly 2-balanced, we 
show that for some c,d > 0, with high probability as n ^ c», the final graph of the 
H-free process contains no subgraphs F on vf ^ n'^ vertices with maximum density 
^ ■ maxjcFjej/u,/} > c. This extends and generalizes results of Gerke and Makai for the 

Ca-free process. 
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1 Introduction 

o' 

Almost fifty years ago, Erdos and Renyi [7j introduced the random graph process G{n,i). This 
■ starts with the empty graph on n vertices and adds i edges one by one, where each edge is chosen 

I uniformly at random among all edges not yet present. In their seminal 1960 paper [5], the first 

problem they studied is the so-called small subgraphs problem. Given a fixed graph F with ep edges 
and vp vertices, it asks whether G{n,i) whflll contains a copy of F as a subgraph or not. It took 
! twenty years until Bollobas [1] solved this problem in full generality, showing that the so-called 

' maximum density m{F) is the crucial parameter that essentially determines the appearance of F. 

o ; 

fSJ , Theorem 1. [4J Let F be a fixed non-empty graph. Then 

. where m{F) := max{ej /vj \ J Q F and vj > 1}. 

In a related model it took nearly thirty years to solve the small subgraphs problem. Random d- 
\ regular graphs Gn,d have been studied since around 1980, cf. [21], but only in 2007 Kim, Sudakov and 

^ ■ Vu [To] established the analogue of Theorem [1] for Gn^', again m{F) turns out to be the important 

quantity. 

In this paper we consider a natural variant of the classical random graph process, which has recently 
attracted a lot of attention. Given some fixed graph H, this process also starts with an empty graph 
and then add edges one by one, but each new edge is now chosen uniformly at random subject to 
the condition that no copy of H is formed. This so-called H-free process was suggested by Bollobas 
and Erdos [3j at a conference in 1990, and it was first described in print in 1995 by Erdos, Suen 
and Winkler [6j, who asked how many edges the final graph typically has. In 2001 Osthus and 
Taraz |llj answered this basic question up to logarithmic factors for the class of strictly 2-balanced 
graphs, i.e. where H satisfies vh^^h ^ 3 and for all proper subgraphs K oi H with vk > 3 vertices 
we have 

VK-2 VH-2 ^ 
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Many interesting graphs are strictly 2-balanced, including cycles C^, complete graphs K^, complete 
r-partite graphs Kt^...^t and the d-dimensional cube. Only in a breakthrough in 2009, Bohman [T] was 
able to close the logarithmic gap mentioned above for the Cs-free process, giving (up to constants) 
matching upper and lower bounds for the final number of edges. Subsequently several additional 
special cases have been settled, see e.g. [I2l [I3l [TSl [18]. Very recently, the final number of edges 
in the C^-free process was resolved in [T^, giving the first matching bounds for a non-trivial class 
of graphs. As one can see, much research has been devoted to understanding the combinatorial 
structure of the final graph of the iJ-free process, but so far even very basic properties are not well 
understood. 

The main focus of the present work is the small subgraphs problem in the final graph of the H-hee 
process, where H is strictly 2-balanced. An intriguing consequence of the recent analysis of Bohman 
and Keevash [2] is that although the graph G{i) produced by the if-free process after adding i edges 
contains no copies of H, during some initial phase the number of small subgraphs in G{i) and the 
unconstrained G{n, i) are roughly the same (for the Cs-free process similar results were obtained by 
Wolfovitz \16\ I17j). Regarding subgraph containment, their results imply the following statement. 
Recall that m{F) denotes the maximum density of a graph F. 

Theorem 2. Let H be a strictly 2-balanced graph. Suppose that F is a fixed non-empty H-free 
graph. Then there exists ^ = ^,{F,H) such that for m := ^n^~"^/'^^*^^''(logn)^/*^'^^~^'' we have 



Unfortunately, the results in [2j only hold during the first m steps of the //-free process, which 
motivates further investigation of the evolution in later steps. Since precise structural properties of 
the final graph are not known so far, we do not ask whether an analogue of Theorem [T] also holds 
in the later evolution, but restrict our attention to the basic question whether fixed H-fiee graphs 
F satisfying, say, m{F) ^ d2{H) appear or not. For the special case H = C3 this has recently 
been addressed by Gerke and Makai [8]: they proved that for some c > 0, whp fixed graphs F with 
ep/vp > c do not appear in the Cs-free process. We would like to remark that such a behaviour is 
not necessarily true for all constrained graph processes. For example, Gerke Schlatter, Steger and 
Taraz ^ showed that in the random planar graph process (where random edges are added subject 
to the condition of maintaining planarity) every fixed planar graph appears whp. This raises the 
question what behaviour the general //-free process exhibits. 

1.1 Main result 

In this paper we prove that for strictly 2-balanced H, the //-free process contains whp no copies of 
sufficiently dense graphs, even if their sizes grow moderately in n. In fact, we obtain a new result 
for the final graph of the //-free process regarding the number of edges in every small subset of the 
vertices. As usual, given A CI [n], we write e{A) for the number of edges joining vertices in A. 

Theorem 3. For every strictly 2-balanced graph H there exist c,d>0 such that whp in the final 
graph of the H-free process we have e{A) < c\A\ for all A C [n] with 1 < |^| < n'^. 

This estimate is in contrast to most known results, which only hold during some initial number of 
steps. We immediately deduce the following statement regarding the small subgraphs problem in 
the final graph of the //-free process, which complements the results in |2j (see e.g. Theorem [2]). 
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Corollary 4. For every strictly 2-balanced graph H there exist c,d>0 such that whp the final graph 
of the H-free process contains no copy of any graph F with 1 < vp ^ n'^ vertices and m{F) > c. □ 

Up to constants the bound m{F) > c is best possible, since the results of Bohman and Keevash [2j 
imply that H-hee graphs F with vp = 0{1) vertices and m{F) < d2{H) do appear in the H-free 
process (cf. Theorem [2]). For the special case H = C3, Gerke and Makai [8j have previously obtained 
a similar result for fixed graphs F. So, Corollary |4] not only generalizes the main result of [8], but 
moreover demonstrates that whp dense graphs F never appear in the H-free process, also if their 
number of vertices grow moderately in n. In fact, we believe that fixed graphs with maximum 
density strictly larger than d2{H) do not appear in the H-free process. 

Conjecture 5. Let H be a strictly 2-balanced graph and suppose that F is a fixed non-empty graph 
satisfying m{F) > d2{H). Then whp the final graph of the H-free process contains no copy of F. 

We now outline our strategy for proving Theorem [3l Intuitively, we show that whp for every 
possible placement of [c|A|] edges inside some set A C [n] satisfying \A\ < n'^, already after the 
first m steps there exists a 'witness' which certifies that not all of these edges can appear in the 
H-free process. The same basic idea was used in [8J, but the main part of their argument is 
tailored towards the (simpler) Cs-free case (in fact, a similar idea has also previously been used for 
bounding the independence number of the H-free process in [H [2]). By contrast, our argument is 
for the more general H-free process, where H is strictly 2-balanced, and one important ingredient 
are the estimates obtained by Bohman and Keevash [2j. For the sake of simplicity and clarity of 
presentation, we have made no attempt to optimize the constants obtained in our proof, and we 
also omit fioor and ceiling signs whenever these are not crucial. 

2 Preliminaries 

In this section we introduce our notation and review properties of the H-free process. We closely 
follow [2], and the reader familiar with these results may wish to skip this section. 

2.1 Constants, functions and parameters 

In the remainder of this paper we consider a fixed strictly 2-balanced graph H. We first choose e 
and then fj, small enough such that, in addition to the implicit constraints in [2j for H, we have 

""^'^{^■Wh)} 2e„(2,)"-<e. (1) 

So £ and ^ are absolute constants (depending only on H)^ since the additional constraints in [2j 
only depend on H. Writing avX{H) for the number of automorphisms of H, following [2J we define 

p.^^-i/d,{H) ^ m:=//nVlogn)i/(^«-i) and (?(t) := e-^-^^"*^^)''^^*^^"' . (2) 

For every i < m we set t = t{i) := i/{n'^p), but will just write t if there is no danger of confusion. 

2.2 Terminology and notation 

Let G{i) denote the graph with vertex set [n] = {1, . . . ,n} after i steps of the H-free process. Its 
edge set E{i) contains i edges and we partition the remaining non-edges ('2^) \ E{i) into two sets 
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0{i) and C{i) which we cah open and closed pairs, respectively. We say that a pair uv of vertices is 
closed in G{i) if G{i) U {uv} contains a copy of H. Observe that the H-hee process always chooses 
the next edge Cj+i uniformly at random from 0{i). In addition, for uv € 0{i) we write Cuvii) for 
the set of pairs xy G 0{i) such that adding uv and xy to G{i) creates a copy of H containing both 
uv and xy. Note that uv G 0{i) would become closed, i.e. belong to C{i + 1), if Cj+i € Cuv{i)- We 
remark that in contrast to [2j we work only with sets of unordered pairs. 



2.3 Previous results for the H-tree process 



Using Wormald's differential equation method |191l20j. Bohman and Keevash [2] track a wide range 
of variables throughout the first m steps of the H-fiee process, where H is strictly 2-balanced. Prom 
this they deduce their remarkable lower bound on the final number of edges. For our argument the 
key properties are estimates for the number of open and closed pairs and certain density statements. 
The following theorem conveniently summarizes these in a (highly) simplified form. 

Theorem 6. [2\ Suppose H is strictly 2-balanced. Set /3h '■= eni&H — l)/aut(i/). Let Hj denote 
the event that for every n^p < i < j, in G{i) we have 



\0{i)\<q{t)n' , 

\Cuv{^\> Ph ■ {2tY"-h{t)p-^ 

\Guv{i)^Gu'v'{^\<n-^'^»p-^ 

e{A) < max{8e"^|yl|,p|ylpn^^} 

Then Hm holds whp in the H-free process. 



(3) 

for all pairs uv € 0{i) , (4) 
for all distinct uv, u'v' £ 0{i) and (5) 
for all sets A ^[n] . (6) 

□ 



Both ([3|) and ([6|) follow readily from Theorem 1.4 and Lemma 4.2 in [2]. Corollary 6.2 and Lemma 8.4 
in [2] give and ([5]) by elementary considerations (the 'high probability events' in fail with 
probability at most n~'^*^^\ so there is no problem in taking a union bound over all steps and pairs). 

It should be noted that Theorem [6] does not directly imply our main result. The important difference 
here is that ([6]) is only valid during the first m steps, whereas Theorem [3] holds in the final graph of 
the H-free process. In fact, the proof used in [2] breaks down when m is too large, and this explains 
why a different approach is needed to obtain results that also hold in later steps. 



3 The proof 

Recall that when a pair becomes closed it has not yet been added to the graph produced by the 
H-fvee process, and furthermore can never be added in future steps, as this would create a copy 
of H. So, to prove that a certain set of edges F does not appear in the H-free process, it suffices to 
show that already after the first m steps, at least one of its edges is closed. 

Proof of Theorem O For the sake of concreteness we prove the theorem with 

r 16 13 • 2^ 1 , , f 1 1 1 1 

c:=max<^— ,- — -r ^ and d:=mm<^-, e,——-2e,l'> , (7) 

where e and // are chosen as in Section 12.11 and Ph is defined as in Theorem [6j Given i < m and 
F Q (^2')' denote the event that F n C(i) = 0. Let £m denote the event that there 
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exists A C [n] and F C (^) with 1 < |A| < n'^ and |F| = [c|^|] for which Sp^^n holds. Note that 
if £m fails, then e{A) < c\A\ for all A C [n] with 1 < |A| < n'^, since, as discussed above, none 
of the corresponding edge sets F can appear in the H-bee process. So, because Hm holds whp by 
Theorem [Gj in order to complete the proof it suffices to show 

F[£^ A n„^] = 0(1) . (8) 

Fix A C [n] and F C [fj with 1 < \A\ = a < n'^ and |F| = \ca]. With foresight, let Opii) ^ 0(i) 
denote the open pairs which would close at least one pair of F if chosen as the next edge ej+i. Then 

n£F,m A n,n] = IP[^F,m/2 A 71^/2] H ^[£F,^+l A [ £F,^ A 

m/2<i<m— 1 , , 

< Jl P[ei+i ^ I ^F.^AT^i] . 

m/2<i<m—l 

Note that A Hi depends only on the first i steps, so given this, the process fails to choose ej+i 
from Opii) with probability 1 — |Oi;'(i)|/|0(z)|. With this in mind, we claim that in order to prove 
([8]) it suffices to show that for m/2 < i < m, whenever £F,i A T-Li holds we have 

, ^ , ^, 13a log n , ^ , , , , , 

Of i > —\0{^\ ■ 10 

m 

Indeed, combining Q and (fTOj) . using the inequality 1 — x < we deduce, say, 

F[£F,m A Ura] < e-^"'°S" = n'^" . (11) 

Now, taking a union bound over all choices of A and F, we obtain 



^a^2(ca+l)^- 



l<a<n'* l<a<n'* 

Using 1 < a <n'^ and ([7]), i.e. d < min{l/c, 1}, we see that 

which readily implies P[£^m A?^m] = o(l). To sum up, assuming ([TO]) we have established the desired 
formula ([8]). 

In the remainder we prove (llOp for m/2 < i < m, whenever £p^i A "Hj holds. Since uv F f] 0{i) 
would belong to C{i + 1) iff ej+i G Ct,^(z), we deduce Opii) = U™GFno{j) Therefore 

\OF{i)\> - E |o«.«nCnvWI • (12) 

raG-FnO{i) MD,«'D'G-FnO(i) 

Observe that for a < n'^ we have a > pa^v?^ by definition of p and d, cf. ([2]) and ([7]). Recall that 
([6]) holds on ^^j. So, using |F| = \ca\ and d?]), i.e. c > 16e~^, we deduce 

e{A) < max{8e"^a,pa^n2^} = 8e"^a < c/2 • a < |F|/2 . (13) 

Recall that whenever £p^i holds, then we have FnC{i) = 0, which in turn implies FriO{i) = F\E{i). 
So, using F C ('^) and (fT3l) . we see that 

|F n 0{i)\ = \F\ E{i)\ > \F\ - e{A) > \F\/2 . (14) 
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Note that and ([5|) hold on Hi. Substituting these estimates as well as ()14p into (jl2p . we have 

\OF{i)\ > \F\/2 ■ ^H{2tf"'\{t)p'^ - \F\^ ■ n-^l^^v'^ ■ 

Observe that ([1]) and ([2]) imply q(t) > for i < m. So, since |F| < 3cn'^, by d?]) we see that 
\F\n~^^'^'^ < 3cn~^ < 3cq{t). Using that t = i/{v?p) = for i > m/2, we crudely obtain 

\OF{i)\ > \F\I3h2^»~^ ■f»-'^q{t)p-^ . 

Note that on T-Li we furthermore have q{t) > \0{i)\/'n? by ([3]). So, writing t = i/{'n?p) and using 
\F\ > ca as well as ([7]), for m/2 < i < m = fin p{log deduce 

|0^(z)| > \F\f^H2^-- \ q{t) > \F\Ph2^«- \ 'l"^' \0{i)\ 



To summarize, we have established (jlOp and, as explained, this completes the proof. □ 

The main difficulty in the above prove is the estimate (jlOp . A similar bound is implicit in the 
approach of Gerke and Makai [8], but their argument is tailored towards the (simpler) Cs-free case. 
By contrast, our proof exploits a combinatorial characterization of Opii) for the more general H- 
free process, where H is strictly 2-balanced, which in turn enables us to prove stronger results (e.g. 
we allow for |^| < n'^ instead of constant size). In fact, we can also obtain the asymptotic size of 
|Oi?(i)| using the results in [2]; we leave these details to the interested reader. Furthermore, our 
approach avoids a subtle conditioning issue: in [8] the authors condition on events that depend on 
the first m steps, but seem to assume that after the first i steps, with i < m, the next edge e^+i is 
still chosen uniformly at random from 0{i). 
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